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ATG AGT GCT ACC CGG ATT CAA GCA GTT TAT CGT GAT ACG GGG GTA GAG 
Met Ser Ala Thr Arg He Gin Ala Val Tyr Arg Asp Thr Gly Val Glu 
1 5 10 15 

GCT TAT CGT GAT AAT CCT TTT ATC GAG GCC TTA CCA CCA TTA CAA GAG 
Ala Tyr Arg Asp Asn Pro Phe He Glu Ala Leu Pro Pro Leu Gin Glu 
20 25 30 

TCA GTG AAT AGT GCT GCA TCA CTG AAA TCC TCT TTA GAG CTT ACT TCC 
Ser Val Asn Ser Ala Ala Ser Leu Lys Ser Ser Leu Gin Leu Thr Ser 

35 40 45 

TCT GAG TTG CAA AAG TCC CGT GTT ATC AGA GCT CAT ACC ATT TGT CGT 
Ser Asp Leu Gin Lys Ser Arg Val He Arg Ala His Thr He Cys Arg 
50 55 60 

ATT CCA GAT GAC TAT TTT CAG CCA TTA GGT ACG CAT TTG CTA CTA AGT 
He Pro Asp Asp Tyr Phe Gin Pro Leu Gly Thr His Leu Leu Leu Ser 
S5 70 75 80 

GAG CGT ATT TCG GTC ATG ATT CGA GGT GGC TAC GTA GGC AGA AAT CCT 
Glu Arg He Ser Val Met He Arg Gly Gly Tyr Val Gly Arg Asn Pro 
85 90 95 

AAA ACA GGA GAT TTA CAA AAG CAT TTA CAA AAT GGT TAT GAG CGT GTT 
Lys Thr Gly Asp Leu Gin Lys His Leu Gin Asn Gly Tyr Glu Arg Val 
100 105 110 

CAA ACG GGA GAG TTG GAG ACA TTT CGC TTT GAG GAG GCA CGA TCT ACG 
Gin Thr Gly Glu Leu Glu Thr Phe Arg Phe Glu Glu Ala Arg Ser Thr 

115 120 125 

GCA CAA AGC TTA TTG TTA ATT GGT TGT TCT GGT AGT GGG AAG ACG ACC 
Ala Gin Ser Leu Leu Leu He Gly Cys Ser Gly Ser Gly Lys Thr Thr 
130 135 X40 

TCT CTT CAT CGT ATT CTA GCC ACG TAT CCT CAG GTG ATT TAC CAT CGT 
Ser Leu His Arg He Leu Ala Thr Tyr Pro Gin Val He Tyr His Arg 
145 ISO 155 

GAA CTC AAT GTA GAG CAG GTG GTG TAT TTG AAA ATA GAC TGC TCG CAT 
Glu Leu Asn Val Glu Gin Val Val Tyr Leu Lys He Asp Cys Ser His 
165 170 175 

AAT GGT TCG CTA AAA GAA ATC TGC TTG AAT TTT TTC AGA GCG TTG GAT 
Asn Gly Ser Leu Lys Glu He Cys Leu Asn Phe Phe Arg Ala Leu Asp 
180 185 190 

CGA GCC TTG GGC TCG AAC TAT GAG CGT CGT TAT GGC TTA AAA CGT CAT 
Arg Ala Leu Gly Ser Asn Tyr Glu Arg Arg Tyr Gly Leu Lys Arg His 
195 200 205 

GGT ATA GAA ACC ATG TTG GCT TTG ATG TCG CAA ATA GCC AAT GCA CAT 
Gly He Glu Thr Met Leu Ala Leu Met Ser Gin He Ala Asn Ala His 
210 215 220 

GCT TTA GGG TTG TTG GTT ATT GAT GAA ATT CAG CAT TTA AGC CGC TCT 
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Ala Leu Gly Leu Leu Val lie Asp Glu lie Gin His Leu Ser Arg Ser 
225 230 235 240 



CGT TCG GGT GGA TCT CAA GAG ATG CTG AAC TTT TTT GTG ACG ATG GTG 
Arg Ser Gly Gly Ser Gin Glu Met Leu Asn Phe Phe Val Thr Met Val 
245 250 255 

AAT ATT ATT GGC GTA CCA GTG ATG TTG ATT GGT ACC OCT AAA GCA CGA 
Asn lie lie Gly Val Pro Val Met Leu lie Gly Thr Pro Lys Ala Arg 
260 265 270 

GAG ATT TTT GAG GCT GAT TTG CGG TCT GCA CGT AGA GGG GCA GGG TTT 
Glu lie Phe Glu Ala Asp Leu Arg Ser Ala Arg Arg Gly Ala Gly Phe 
275 280 285 

GGA GCT ATA TTC TGG GAT CCT ATA CAA CAA ACG CAA CGT GGA AAG CCC 
Gly Ala lie Phe Trp Asp Pro lie Gin Gin Thr Gin Arg Gly Lys Pro 
290 295 300 

AAT CAA GAG TGG ATC GCT TTT ACG GAT AAT CTT TGG CAA TTA GAG CTT 
Asn Gin Glu Trp lie Ala Phe Thr Asp Asn Leu Trp Gin Leu Gin Leu 
305 310 315 320 

TTA CAA CGC AAA GAT GCG CTG TTA TCG GAT GAG GTC CGT GAT GTG TGG 

Leu Gin Arg Lys Asp Ala Leu Leu Ser Asp Glu Val Arg Asp Val Trp 

325 330 335 

TAT GAG CTA AGC CAA GGA GTG ATG GAG ATT GTA GTA AAA CTT TTT GTA 
Tyr Glu Leu Ser Gin Gly Val Met Asp lie Val Val Lys Leu Phe Val 
340 345 350 

CTC GCT CAG CTC CGT GCG CTA GCT TTA GGC AAT GAG CGT ATT ACC GCT 
Leu Ala Gin Leu Arg Ala Leu Ala Leu Gly Asn Glu Arg lie Thr Ala 
355 360 365 

GGT TTA TTG CGG CAA GTG TAT CAA GAT GAG TTA AAG CCT GTG CAC CCC 
Gly Leu Leu Arg Gin Val Tyr Gin Asp Glu Leu Lys Pro Val His Pro 
370 375 380 

ATG CTA GAG GCA TTA CGC TCG GGT ATC CCA GAA CGC ATT GCT CGT TAT 
Met Leu Glu Ala Leu Arg Ser Gly lie Pro Glu Arg lie Ala Arg Tyr 

385 390 395 400 

TCT GAT CTA GTC GTT CCC GAG ATT GAT AAA CGG TTA ATC CAA CTT CAG 
Ser Asp Leu Val Val Pro Glu lie Asp Lys Arg Leu lie Gin Leu Gin 
405 410 415 

CTA GAT ATC GCA GCG ATA CAA GAA CAA ACA CCA GAA GAA AAA GCC CTT 

Leu Asp He Ala Ala He Gin Glu Gin Thr Pro Glu Glu Lys Ala Leu 
420 425 430 

CAA GAG TTA GAT ACC GAA GAT CAG CGT CAT TTA TAT CTG ATG CTG AAA 
Gin Glu Leu Asp Thr Glu Asp Gin Arg His Leu Tyr Leu Met Leu Lys 
435 440 445 

GAG GAT TAC GAT TCA AGC CTG TTA ATT CCC ACT ATT AAA AAA GCG TTT 
Glu Asp Tyr Asp Ser Ser Leu Leu He Pro Thr He Lys Lys Ala Phe 
450 455 460 
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AGC CAG AAT CCA ACG ATG ACA AGA CAA AAG TTA CTG CCT CTT GTT TTG 
Ser Gin Asn Pro Thr Met Thr Arg Gin Lys Leu Leu Pro Leu Val Leu 
465 470 475 480 

CAG TGG TTG ATG GAA GGC GAA ACG GTA GTG TCA GAA CTA GAA AAG CCC 
Gin Trp Leu Met Glu Gly Glu Thr Val Val Ser Glu Leu Glu Lys Pro 
485 490 495 

TCC AAG AGT AAA AAG GTT TCG GOT ATA AAG GTA GTC AAG CCC AGC GAC 
Ser Lys Ser Lys Lys Val Ser Ala lie Lys Val Val Lys Pro Ser Asp 

SOD 505 510 

TGG GAT AGC TTG CCT GAT ACG GAT TTA CGT TAT ATC TAT TCA CAA CGC 
Trp Asp Ser Leu Pro Asp Thr Asp Leu Arg Tyr lie Tyr Ser Gin Arg 
515 520 525 

CAA CCT GAA AAA ACC ATG CAT GAA CGG TTA AAA GGG AAA GGG GTA ATA 
Gin Pro Glu Lys Thr Met His Glu Arg Leu Lys Gly Lys Gly Val He 
530 535 ■ 540 

GTG GAT ATG GCG AGC TTA TTT AAA CAA GCA GGT TAG CC 
Val Asp Met Ala Ser Leu Phe Lys Gin Ala Gly * 
545 550 555 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 556 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Ser Ala Thr Arg He Gin Ala Val Tyr Arg Asp Thr Gly Val Glu 
15 10 15 

Ala Tyr Arg Asp Asn Pro Phe He Glu Ala Leu Pro Pro Leu Gin Glu 
20 25 30 

Ser Val Asn Ser Ala Ala Ser Leu Lys Ser Ser Leu Gin Leu Thr Ser 
35 40 45 

Ser Asp Leu Gin Lys Ser Arg Val He Arg Ala His Thr He Cys Arg 

50 55 60 

He Pro Asp Asp Tyr Phe Gin Pro Leu Gly Thr His Leu Leu Leu Ser 
65 70 75 80 

Glu Arg He Ser Val Met He Arg Gly Gly Tyr Val Gly Arg Asn Pro 
85 90 95 

Lys Thr Gly Asp Leu Gin Lys His Leu Gin Asn Gly Tyr Glu Arg Val 
100 105 110 
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Gin. Thr Gly Glu Leu Glu Thr Phe Arg Phe Glu Glu Ala Arg Ser Thr 
115 120 125 



Ala Gin Ser Leu Leu Leu lie Gly Cys Ser Gly Ser Gly Lys Thr Thr 
130 135 140 

Ser Leu His Arg" lie Leu Ala Thr Tyr Pro Gin Val lie Tyr His Arg 
145 150 155 160 

Glu Leu Asn Val Glu Gin Val Val Tyr Leu Lys lie Asp Cys Ser His 
165 170 175 

Asn Gly Ser Leu Lys Glu lie Cys Leu Asn Phe Phe Arg Ala Leu Asp 
180 185 190 

Arg Ala Leu Gly Ser Asn Tyr Glu Arg Arg Tyr Gly Leu Lys Arg His 
195 200 205 

Gly lie Glu Thr Met Leu Ala Leu Met Ser Gin He Ala Asn Ala His 
210 215 220 

Ala Leu Gly Leu Leu Val He Asp Glu lie Gin His Leu Ser Arg Ser 
225 230 235 240 

Arg Ser Gly Gly Ser Gin Glu Met Leu Asn Phe Phe Val Thr Met Val 

245 250 255 

Asn He He Gly Val Pro Val Met Leu He Gly Thr Pro Lys Ala Arg 
260 265 270 

Glu He Phe Glu Ala Asp Leu Arg Ser Ala Arg Arg Gly Ala Gly Phe 
275 280 285 

Gly Ala He Phe Trp Asp Pro He Gin Gin Thr Gin Arg Gly Lys Pro 
290 295 300 

Asn Gin Glu Trp He Ala Phe Thr Asp Asn Leu Trp Gin Leu Gin Leu 
305 310 315 320 

Leu Gin Arg Lys Asp Ala Leu Leu Ser Asp Glu Val Arg Asp Val Trp 
325 . 330 335 

Tyr Glu Leu Ser Gin Gly Val Met Asp He Val Val Lys Leu Phe Val 
340 34S 350 

Leu Ala Gin Leu Arg Ala Leu Ala Leu Gly Asn Glu Arg He Thr Ala 
355 360 365 

Gly Leu Leu Arg Gin Val Tyr Gin Asp Glu Leu Lys Pro Val His Pro 
370 375 380 

Met Leu Glu Ala Leu Arg Ser Gly He Pro Glu Arg He Ala Arg Tyr 
385 390 395 400 

Ser Asp Leu Val Val Pro Glu He Asp Lys Arg Leu He Gin Leu Gin 
405 410 415 



Leu Asp He Ala Ala He Gin Glu Gin Thr Pro Glu Glu Lys Ala Leu 



420 



425 



430 



Gin Glu Leu Asp Thr Glu Asp Gin Arg His Leu Tyr Leu Met Leu Lys 
435 440 445 

Glu Asp Tyr Asp Ser Ser Leu Leu lie Pro Thr lie Lys Lys Ala Phe 
450 455 460 

Ser Gin Asn Pro Thr Met Thr Arg Gin Lys Leu Leu Pro Leu Val Leu 
465 470 475 480 

Gin Trp Leu Met Glu Gly Glu Thr Val Val Ser Glu Leu Glu Lys Pro 

485 490 495 



Ser Lys Ser Lys Lys Val Ser Ala lie Lys Val Val Lys Pro Ser Asp 

500 505 510 

Trp Asp Ser Leu Pro Asp Thr Asp Leu Arg Tyr lie Tyr Ser Gin Arg 
515 520 525 

Gin Pro Glu Lys Thr Met His Glu Arg Leu Lys Gly Lys Gly Val lie 
530 535 540 



Val Asp Met Ala Ser Leu Phe Lys Gin Ala Gly * 
545 550 555 

(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "pEM delta R.adj to 1" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TTTAGAGCAA TTCGGT6TTA GTTTCAGCAA GCAAACATTA ACCATAGCTA ATGATTTATA 60 

GCCATATTAA CCATTGGGGT ACCGAGCTCG AATTCCATGG TCTGTTTCCT GTGTGAAATT 120 

GTTATCCGCT CACAATTCCA CACATTATAC GAGCCGGATG ATTAATTGTC AACAGCTCAT 180 

TTCAGAATAT TTGCCAGAAC CGTTATGATG TCGGCGCAAA AAACATTATC CAGAACGGGA 240 

GTGCGCCTTG AGCGACACGA ATTATGCAGT GATTTACGAC CTGCACAGCC ATACCACAGC 300 

TTCCGATGGC TGCCTGACGC CAGAAGCATT GGTGCACCGT GCAGTCGATG ATAAGCTGTC 360 

AAACCAGATC AATTCGCGCT AACTCACATT AATTGCGTTG CGCTCACTGC CCGCTTTCCA 42 0 

GTCGGGAAAC CTGTCGTGCC AGCTGCATTA ATGAATCGGC CAACGCGCGG GGAGAGGCGG 480 
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TTTGCGTATT GGGCGCCAGG GTGGTTTTTC TTTTCACCAG TGAGACGGGC AACAGCTGAT 54 0 

TGCCCTTCAC CGCCTGGCCC TGAGAGAGTT GCAGCAAGCG GTCCACGCTG GTTTGCCCCA 600 

GCAGGCGAAA ATCCTGTTTG ATGGTGGTTG ACGGCGGGAT ATAACATGAG CTGTCTTCGG 660 

TATCGTCGTA TCCCACTACC GAGATATCCG CACCAACGCG CAGCCCGGAC TCGGTAATGG 720 

CGCGCATTGC GCCCAGCGCC ATCTGATCGT TGGCAACCAG CATCGCAGTG GGAACGATGC 780 

CCTCATTCAG CATTTGCATG GTTTGTTGAA AACCGGACAT GGCACTCCAG TCGCCTTCCC 840 

GTTCCGCTAT CGGCTGAATT TGATTGCGAG TGAGATATTT ATGCCAGCCA GCCAGACGCA 900 

GACGCGCCGA GACAGAACTT AATGGGCCCG CTAACAGCGC GATTTGCTGG TGACCCAATG 960 

CGACCAGATG CTCCACGCCC AGTCGCGTAC CGTCTTCATG GGAGAAAATA ATACTGTTGA 1020 

TGGGTGTCTG GTCAGAGACA TCAAGAAATA ACGCCGGAAC ATTAGTGCAG GCAGCTTCCA 1080 

CAGCAATGGC ATCCTGGTCA TCCAGCGGAT AGTTAATGAT CAGCCCACTG ACGCGTTGCG 1140 

CGAGAAGATT GTGCACCGCC GCTTTACAGG CTTCGACGCC GCTTCGTTCT ACCATCGACA 1200 

CCACCACGCT GGCACCCAGT TGATCGGCGC GAGATTTAAT CGCCGCGACA ATTTGCGACG 1260 

GCGC6TGCAG GGCCAGACTG GAGGTGGCAA CGCCAATCAG CAACGACTGT TTGCCCGCCA 1320 

GTTGTTGTGC CACGCGGTTG GGAATGTAAT TCAGCTCCGC CATCGCCGCT TCCACTTTTT 1380 

CCCGCGTTTT CGCAGAAACG TGGCTGGCCT GGTTCACCAC GCGGGAAACG GTCTGATAAG 1440 

AGACACCGGC ATACTCTGCG ACATCGTATA ACGTTACTGG TTTCACATTC ACCACCCTGA 1500 

ATTGACTCTC TTCCGGGCGC TATCATGCCA TACCGCGAAA GGTTTTGCAC CATTCGATGG 1560 

TGTCAACGTA AATGCATGCC GCTTCGCCTT CGCGCGCGAA TTGATCTGCT GCCTCGCGCG 1620 

TTTCGGTGAT GACGGTGAAA ACCTCTGACA CATGCAGCTC CCGGAGACGG TCACAGCTTG 1680 

TCTGTAAGCG GATGCCGGGA GCAGACAAGC CCGTCAGGGC GCGTCAGCGG GTGTTGGCGG 1740 

GTGTCGGGGC GCAGCCATGA CCCAGTCACG TAGCGATAGC GGAGTGTATA CTGGCTTAAC 1800 

TATGCGGCAT CAGAGCAGAT TGTACTGAGA GTGCACCATA TGCGGTGTGA AATACCGCAC 1860 

AGATGCGTAA GGAGAAAATA CCGCATCAGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG 1920 

CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG 1980 

TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG 2 040 

GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC 2100 

GAGCATCACA AAAATCGACG CTCAAGTCAG .AGGTGGCGAA ACCCGACAGG ACTATAAAGA 2160 

TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT 2220 
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ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA TAGCTCACGC 2280 
TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC 2340 
CCCGTTCAGC CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA 2400 
AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT 2460 
GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA 2520 
GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT 2580 
TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT 2 640 
ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT 2700 
CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC 2 760 
ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA 2820 
ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA 2880 
TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC 2940 
TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT 3000 
TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA 3060 
TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 3120 
AATAGTTTGC GCAACGTTGT TGCCATTGCT GTAGGCATCG TGGTGTCACG CTCGTCGTTT 3180 
GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG 3240 
TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC 3300 
GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC 3360 
GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG 3420 
CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ACACGGGATA ATACCGCGCC ACATAGCAGA 3480 
ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA 3 540 
CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT 3600 
TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG 3660 
GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA 3 720 
AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT 3780 
AAACAAAAAG AGTTTGTAGA AACGCAAAAA GGCCATCCGT CAGGATGGCC TTCTGCTTAA 3840 
TTTGATGCCT GGCAGTTTAT GGCGGGCGTC CTGCCCGCCA CCCTCCGGGC CGTTGCTTCG 3 900 
CAACGTTCAA ATCCGCTCCC GGCGGATTTG TCCTACTCAG GAGAGCGTTC ACCGACAAAC 3 960 
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AACAGATAAA ACGAAAGGCC CAGTCTTTCG 
CAGTTCCCTA CTCTCGCATG GGGAGACCCC 

TCTGAGTTCG GCATGGGGTC AGGTGGGACC 
TTTATCAGAC CGCTTCTGCG TTCTGATTTA 
CCGCCAAAAC AGCCAAGCTT GCATGCCTGC 
CCGTCGGACA GCTTTAATAA ACCCTGCACT 
TGGGAACTGG GAGGGGTGGA AATGGAGTTT 
TAGATGGGAA CTGGGTGTAG CGTCGTAAGC 
TTGGAACTAG ATTTCACTTA TCTGGTTGGT 
TTGTGTCTCA AAATCTCTGA TGTTACATTG 
TAAAACTGTC TGCTTACATA AACAGTAATA 
AAACGTCTTG CTCGAGGCCG CGATTAAATT 
AATGGGCTCG CGATAATGTC GGGCAATCAG 
CCGATGCGCC AGAGTTGTTT CTGAAACATG 
ATGAGATGGT CAGACTAAAC TGGCTGACGG 
TTATCCGTAC TCCTGATGAT GCATGGTTAC 
TCCAGGTATT AGAAGAATAT CCTGATTCAG 
TCCTGCGCCG GTTGCATTCG ATTCCTGTTT 
TTCGTCTCGC TCAGGCGCAA TCACGAATGA 
ATGACGAGCG TAATGGCTGG CCTGTTGAAC 
CATTCTCACC GGATTCAGTC GTCACTCATG 
ACGAGGGGAA ATTAATAGGT TGTATTGATG 
AGGATCTTGC CATCCTATGG AACTGCCTCG 
TTTTTCAAAA ATATGGTATT GATAATCCTG 
TCGATGAGTT TTTCTAATCA GAATTGGTTA 
TGACTTGACG GGACGGCGGC TTTGTTGAAT 
GATCACGCAT CTTCCCGACA ACGCAGACCG 
CAACTGGTCC ACCTACAACA AAGCTCTCAT 
TGATGGGGCG ATTCAGGCCT GGTATGAGTC 



ACTGAGCCTT TCGTTTTATT TGATGCCTGG 4020 

ACACTACCAT CGGCGCTACG GCGTTTCACT 4080 

ACCGCGCTAC TGCCGCCAGG CAAATTCTGT 4140 

ATCTGTATCA GGCTGAAAAT CTTCTCTCAT 4200 

AGGTCGACTC TAGAGGATCC CCAAGAAAGT 4260 

TATCTGTTTA GTGTGGGCGG ACAAAATAGT 4320 

TTAAGGATTA TTTAGGGAAG AGTGACAAAA 4380 

TAATACGAAA ATTAAAAATG ACAAAATAGT 4440 

CGACCTGCAG GGGGGGGGGG GAAAGCCACG 4500 

CACAAGATAA AAATATATCA TCATGAACAA 4560 

CAAGGGGTGT TATGAGCCAT ATTCAACGGG 4620 

CCAACATGGA TGCTGATTTA TATGGGTATA 4680 

GTGCGACAAT CTATCGATTG TATGGGAAGC 4740 

GCAAAGGTAG CGTTGCCAAT GATGTTACAG 48 00 

AATTTATGCC TCTTCCGACC ATCAAGCATT 4860 

TCACCACTGC GATCCCCGGG AAAACAGCAT 4920 

GTGAAAATAT TGTTGATGCG CTGGCAGTGT 4 98 0 

GTAATTGTCC TTTTAACAGC GATCGCGTAT 5040 

ATAACGGTTT GGTTGATGCG AGTGATTTTG 5100 

AAGTCTGGAA AGAAATGCAT AAGCTTTTGC 5160 

GTGATTTCTC ACTTGATAAC CTTATTTTTG 5220 

TTGGACGAGT CGGAATCGCA GACCGATACC 5280 

GTGAGTTTTC TCCTTCATTA CAGAAACGGC 5340 

ATATGAATAA ATTGCAGTTT CATTTGATGC 5400 

ATTGGTTGTA ACACTGGCAG AGCATTACGC 54 60 

AAATCGAACT TTTGCTGAGT TGAAGGATCA 552 0 

TTCCGTGGCA AAGCAAAAGT TCAAAATCAC 5580 

CAACCGTGGC TCCCTCACTT TCTGGCTGGA 5640 

AGCAACACCT TCTTCACGAG GCAGACCTCA 5700 
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GCGCCCCCCC CCCCCTGCT^GTCGACCCCA CGCCCCTCTT TAATACGACG GGCAATTTGC 5760 

ACTTCAGAAA ATGAAGAGTT TGCTTTAGCC ATAACAAAAG TCCAGTATGC TTTTTCACAG 5820 

CATAACTGGA CTGATTTCAG TTTACAACTA TTCTGTCTAG TTTAAGACTT TATTGTCATA 5880 

GTTTAGATCT ATTTTGTTCA GTTTAAGACT TTATTGTCCG CCCACA 5926 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "pEM-delta" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

CAGATCAATT CGCGCTAACT CACATTAATT GCGTTGCGCT CACTGCCCGC TTTCCAGTCG 60 

GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG 12 0 

CGTATTGGGC GCCAGGGTGG TTTTTCTTTT CACCAGTGAG ACGGGCAACA GCTGATTGCC 180 

CTTCACCGCC TGGCCCTGAG AGAGTTGCAG CAAGCGGTCC ACGCTGGTTT GCCCCAGCAG 240 

GCGAAAATCC TGTTTGATGG TGGTTGACGG CGGGATATAA CATGAGCTGT CTTCGGTATC 3 00 

GTCGTATCCC ACTACCGAGA TATCCGCACC AACGCGCAGC CCGGACTCGG TAATGGCGCG 360 

CATTGCGCCC AGCGCCATCT GATCGTTGGC AACCAGCATC GCAGTGGGAA CGATGCCCTC 420 

ATTCAGCATT TGCATGGTTT GTTGAAAACC GGACATGGCA CTCCAGTCGC CTTCCCGTTC 480 

CGCTATCGGC TGAATTTGAT TGCGAGTGAG ATATTTATGC CAGCCAGCCA GACGCAGACG 540 

CGCCGAGACA GAACTTAATG GGCCCGCTAA CAGCGCGATT TGCTGGTGAC CCAATGCGAC 600 

CAGATGCTCC ACGCCCAGTC GCGTACCGTC TTCATGGGAG AAAATAATAC TGTTGATGGG 660 

TGTCTGGTCA GAGACATCAA GAAATAACGC CGGAACATTA GTGCAGGCAG CTTCCACAGC 72 0 

AATGGCATCC TGGTCATCCA GCGGATAGTT AATGATCAGC CCACTGACGC GTTGCGCGAG 7 80 

AAGATTGTGC ACCGCCGCTT TACAGGCTTC GACGCCGCTT CGTTCTACCA TCGACACCAC 840 

CACGCTGGCA CCCAGTTGAT CGGCGCGAGA TTTAATCGCC GCGACAATTT GCGACGGCGC 900 

GTGCAGGGCC AGACTGGAGG TGGCAACGCC AATCAGCAAC GACTGTTTGC CCGCCAGTTG 96 0 

TTGTGCCACG CGGTTGGGAA TGTAATTCAG CTCCGCCATC GCCGCTTCCA CTTTTTCCCG 1020 
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CGTTTTCGCA GAAACGTGGC 


TGGCCTGGTT 


CACCACGCGG 


GAAACGGTCT 


GATAAGAGAC 


1080 


ACCGGCATAC 


TCTGCGACAT 


CGTATAACGT 


TACTGGTTTC 


ACATTCACCA 


CCCTGAATTG 


1140 


ACTCTCTTCC 


GGGCGCTATC 


ATGCCATACC 


GCGAAAGGTT 


TTGCACCATT 


CGATGGTGTC 


1200 


AACGTAAATG 


CATGCCGCTT 


CGCCTTCGCG 


CGCGAATTGA 


TCTGCTGCCT 


CGCGCGTTTC 


1260 


GGTGATGACG 


GTGAAAACCT 


CTGACACATG 


CAGCTCCCGG 


AGACGGTCAC 


AGCTTGTCTG 


1320 


TAAGCGGATG 


CCGGGAGCAG 


ACAAGCCCGT 


CAGGGCGCGT 


CAGCGGGTGT 


TGGCGGGTGT 


1380 


CGGGGCGCAG 


CCATGACCCA 


GTCACGTAGC 


GATAGCGGAG 


TGTATACTGG 


CTTAACTATG 


1440 


CGGCATCAGA 


GCAGATTGTA 


CTGAGAGTGC 


ACCATATGCG 


GTGTGAAATA 


CCGCACAGAT 


1500 


GCGTAAGGAG 


AAAATACCGC 


ATCAGGCGCT 


CTTCCGCTTC 


CTCGCTCACT 


GACTCGCTGC 


1560 


GCTCGGTCGT 


TCGGCTGCGG 


CGAGCGGTAT 


CAGCTCACTC 


AAAGGCGGTA 


ATACGGTTAT 


1620 


CCACAGAATC 


AGGGGATAAC 


GCAGGAAAGA 


ACATGTGAGC 


AAAAGGCCAG 


CAAAAGGCCA 


1680 


GGAACCGTAA 


AAAGGCCGCG 


TTGCTGGCGT 


TTTTCCATAG 


GCTCCGCCCC 


CCTGACGAGC 


1740 


ATCACAAAAA 


TCGACGCTCA 


AGTCAGAGGT 


GGCGAAACCC 


GACAGGACTA 


TAAAGATACC 


1800 


AGGCGTTTCC 


CCCTGGAAGC 


TCCCTCGTGC 


GCTCTCCTGT 


TCCGACCCTG 


CCGCTTACCG 


1860 


GATACCTGTC 


CGCCTTTCTC 


CCTTCGGGAA 


GCGTGGCGCT 


TTCTCATAGC 


TCACGCTGTA 


1920 


GGTATCTCAG 


TTCGGTGTAG 


GTCGTTCGCT 


CCAAGCTGGG 


CTGTGTGCAC 


GAACCCCCCG 


1980 


TTCAGCCCGA 


CCGCTGCGCC 


TTATCCGGTA 


ACTATCGTCT 


TGAGTCCAAC 


CCGGTAAGAC 


2040 


ACGACTTATC 


GCCACTGGCA 


GCAGCCACTG 


GTAACAGGAT 


TAGCAGAGCG 


AGGTATGTAG 


2100 


GCGGTGCTAC 


AGAGTTCTTG 


AAGTGGTGGC 


CTAACTACGG 


CTACACTAGA 


AGGACAGTAT 


2160 


TTGGTATCTG 


CGCTCTGCTG 


AAGCCAGTTA 


CCTTCGGAAA 


AAGAGTTGGT 


AGCTCTTGAT 


2220 


CCGGCAAACA 


AACCACCGCT 


GGTAGCGGTG 


GTTTTTTTGT 


TTGCAAGCAG 


CAGATTACGC 


2280 


6CAGAAAAAA 


AGGATCTCAA 


GAAGATCCTT 


TGATCTTTTC 


TACGGGGTCT 


GACGCTCAGT 


2340 


GGAACGAAAA 


CTCACGTTAA 


GGGATTTTGG 


TCATGAGATT 


ATCAAAAAGG 


ATCTTCACCT 


2400 


AGATCCTTTT 


AAATTAAAAA 


TGAAGTTTTA 


AATCAATCTA 


AAGTATATAT 


GAGTAAACTT 


2460 


GGTCTGACAG 


TTACCAATGC 


TTAATCAGTG 


AGGCACCTAT 


CTCAGCGATC 


TGTCTATTTC 


2520 


GTTCATCCAT 


AGTTGCCTGA 


CTCCCCGTCG 


TGTAGATAAC 


TACGATACGG 


GAGGGCTTAC 


2580 


CATCTGGCCC 


CAGTGCTGCA 


ATGATACCGC 


GAGACCCACG 


CTCACCGGCT 


CCAGATTTAT 


2640 


CAGCAATAAA 


CCAGCCAGCC 


GGAAGGGCCG 


■ AGCGCAGAAG 


TGGTCCTGCA 


ACTTTATCCG 


2700 


CCTCCATCCA 


. GTCTATTAAT 


TGTTGCCGGG 


AAGCTAGAGT 


AAGTAGTTCG 


CCAGTTAATA 


2760 
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GTTTGCGCAA CGTTGTTGCC ATTGCTGTAG GCATCGTGGT GTCACGCTCG TCGTTTGGTA 2820 
TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC CCCATGTTGT 2880 
GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG 2940 
TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT TACTGTCATG CCATCCGTAA 3000 
GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT CTGAGAATAG TGTATGCGGC 3060 
GACCGAGTTG CTCTTGCCCG GCGTCAACAC GGGATAATAC CGCGCCACAT AGCAGAACTT 3120 
TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA ACTCTCAAGG ATCTTACCGC 318 0 
TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA 3240 
CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAA^.GGGAA 33 0 0 
TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA 3360 
TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG AAAAATAAAC 3420 
AAAAAGAGTT TGTAGAAACG CAAAAAGGCC ATCCGTCAGG ATGGCCTTCT GCTTAATTTG 348 0 
ATGCCTGGCA GTTTATGGCG GGCGTCCTGC CCGCCACCCT CCGGGCCGTT GCTTCGCAAC 3540 
GTTCAAATCC GCTCCCGGCG GATTTGTCCT ACTCAGGAGA GCGTTCACCG ACAAACAACA 3600 
GATAAAACGA AAGGCCCAGT CTTTCGACTG AGCCTTTCGT TTTATTTGAT GCCTGGCAGT 366 0 
TCCCTACTCT CGCATGGGGA GACCCCACAC TACCATCGGC GCTACGGCGT TTCACTTCTG 372 0 
AGTTCGGCAT GGGGTCAGGT GGGACCACCG CGCTACTGCC GCCAGGCAAA TTCTGTTTTA 3780 
TCAGACCGCT TCTGCGTTCT GATTTAATCT GTATCAGGCT 6AAAATCTTC TCTCATCCGC 3 840 
CAAAACAGCC AAGCTTGCAT GCCTGCAGGT CGACTCTAGA GGATCCCCAA GAAAGTCCGT 3900 
CGGACAGCTT TAATAAACCC TGCACTTATC TGTTTAGTGT GGGCGGACAA AATAGTTGGG 3 96 0 
AACTGGGAGG GGTGGAAATG GAGTTTTTAA GGATTATTTA GGGAAGAGTG ACAAAATAGA 4020 
TGGGAACTGG GTGTAGCGTC GTAAGCTAAT ACGAAAATTA AAAATGACAA AATAGTTTGG 4080 
AACTAGATTT CACTTATCTG GTTGGTCGAC CTGCAGGGGG GGGGGGGAAA GCCACGTTGT 4140 
GTCTCAAAAT CTCTGATGTT ACATTGCACA AGATAAAAAT ATATCATCAT GAACAATAAA 4200 
ACTGTCTGCT TACATAAACA GTAATACAAG GGGTGTTATG AGCCATATTC AACGGGAAAC 4260 
GTCTTGCTCG AGGCCGCGAT TAAATTCCAA CATGGATGCT GATTTATATG GGTATAAATG 4320 
GGCTCGCGAT AATGTCGGGC AATCAGGTGC GACAATCTAT CGATTGTATG GGAAGCCCGA 4380 
TGCGCCAGAG TTGTTTCTGA AACATGGCAA AGGTAGCGTT GCCAATGATG TTACAGATGA 4440 
GATGGTCAGA CTAAACTGGC TGACGGAATT TATGCCTCTT CCGACCATCA AGCATTTTAT 4500 
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CCGTACTCCT GATGATGCAT GGTTACTCAC CACTGCGATC CCCGGGAAAA CAGCATTCCA 4560 

GGTATTAGAA GAATATCCTG ATTCAGGTGA AAATATTGTT GATGCGCTGG CAGTGTTCCT 4620 

GCGCCGGTTG CATTCGATTC CTGTTTGTAA TTGTCCTTTT AACAGCGATC GCGTATTTCG 4680 

TCTCGCTCAG GCGCAATCAC GAATGAATAA CGGTTTGGTT GATGCGAGTG ATTTTGATGA 4740 

CGAGCGTAAT GGCTGGCCTG TTGAACAAGT CTGGAAAGAA ATGCATAAGC TTTTGCCATT 4800 

CTCACCGGAT TCAGTCGTCA CTCATGGTGA TTTCTCACTT GATAACCTTA TTTTTGACGA 4860 

GGGGAAATTA ATAGGTTGTA TTGATGTTGG ACGAGTCGGA ATCGCAGACC GATACCAGGA 4920 

TCTTGCCATC CTATGGAACT GCCTCGGTGA GTTTTCTCCT TCATTACAGA AACGGCTTTT 4980 

TCAAAAATAT GGTATTGATA ATCCTGATAT GAATAAATTG CAGTTTCATT TGATGCTCGA 504 0 

TGAGTTTTTC TAATCAGAAT TGGTTAATTG GTTGTAACAC TGGCAGAGCA TTACGCTGAC 5100 

TTGACGGGAC GGCGGCTTTG TTGAATAAAT CGAACTTTTG CTGAGTTGAA GGATCAGATC 5160 

ACGCATCTTC CCGACAACGC AGACCGTTCC GTGGCAAAGC AAAAGTTCAA AATCACCAAC 5220 

TGGTCCACCT ACAACAAAGC TCTCATCAAC CGTGGCTCCC TCACTTTCTG GCTGGATGAT 5280 

GGGGCGATTC AGGCCTGGTA TGAGTCAGCA ACACCTTCTT CACGAGGCAG ACCTCAGCGC 5340 

CeCCCCCCCC CTGCAGGTCG ACCCCACGCC CCTCTTTAAT ACGACGGGCA ATTTGCACTT 54 00 

CAGAAAATGA AGAGTTTGCT TTAGCCATAA CAAAAGTCCA GTATGCTTTT TCACAGCATA 5460 

ACTGGACTGA TTTCAGTTTA CAACTATTCT GTCTAGTTTA AGACTTTATT GTCATAGTTT 5520 

AGATCTATTT TGTTCAGTTT AAGACTTTAT TGTCCGCCCA CATTTAGAGC AATTCGGTGT 5580 

TAGTTTCAGC TU^GCAAACAT TAACCATAGC TAATGATTTA TAGCCATATT AACCATTGGG 5640 

GTACCGAGCT CGAATTCCAT GGTCTGTTTC CTGTGTGAAA TTGTTATCCG CTCACAATTC 5700 

CACACATTAT ACGAGCCGGA TGATTAATTG TCAACAGCTC ATTTCAGAAT ATTTGCCAGA 5760 

ACCGTTATGA TGTCGGCGCA AAAAACATTA TCCAGAACGG GAGTGCGCCT TGAGCGACAC 5820 

GAATTATGCA GTGATTTACG ACCTGCACAG CCATACCACA GCTTCCGATG GCTGCCTGAC 5880 

GCCAGAAGCA TTGGTGCACC GTGCAGTCGA TGATAAGCTG TCAAAC 5926 
(2) INFORMATION FOR SEQ ID NO: 5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "pER183 (target plasmid) " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GAATTCCGGA TGAGCATTCA TCAGGCGGGC AAGAATGTGA ATAAAGGCCG GATAAAACTT 60 

GTGCTTATTT TTCTTTACGG TCTTTAAAAA GGCCGTAATA TCCAGCTGAA CGGTCTGGTT 120 

ATAGGTACAT TGAGCAACTG ACTGAAATGC CTCAAAATGT TCTTTACGAT GCCATTGGGA 180 

TATATCAACG GTGGTATATC CAGTGATTTT TTTCTCCATT TTAGCTTCCT TAGCTCCTGA 240 

AAATCTCGAT AACTCAAAAA ATACGCCCGG TAGTGATCTT ATTTCATTAT GGTGAAAGTT 300 

GGAACCTCTT ACGTGCCGAT CAACGTCTCA TTTTCGCCAA AAGTTGGCCC AGGGCTTCCC 3 60 

GGTATCAACA GGGACACCAG GATTTATTTA TTCTGCGAAG TGATCTTCCG TCACAGGTAT 420 

TTATTCGGCG CAAAGTGCX3T CGGGTGATGC TGCCAACTTA CTGATTTAGT GTATGATGGT 480 

GTTTTTGAGG TGCTCCAGTG GCTTCTGTTT CTATCAGCTG TCCCTCCTGT TCAGCTACTG 540 

ACGGGGTGGT GCGTAACGGC AAAAGCACCG CCGGACATCA GCGCTAGCGG AGTGTATACT GOO 

GGCTTACTAT GTTGGCACTG ATGAGGGTGT CAGTGAAGTG CTTCATGTGG CAGGAGAAAA 660 

AAGGCTGCAC CGGTGCGTCA GCAGAATATG TGATACAGGA TATATTCCGC TTCCTCGCTC 720 

ACTGACTCGC TACGCTCGGT CGTTCGACTG CGGCGAGCGG AAATGGCTTA CGAACGGGGC 78 0 

GGAGATTTCC TGGAAGATGC CAGGAAGATA CTTAACAGGG AAGTGAGAGG GCCGCGGCAA 84 0 

AGCCGTTTTT CCATAGGCTC CGCCCCCCTG ACAAGCATCA CGAAATCTGA CGCTCAAATC 900 

AGTGGTGGCG AAACCCGACA GGACTATAAA GATACCAGGC GTTTCCCCTG GCGGCTCCCT 960 

CGTGCGCTCT CCTGTTCCTG CCTTTCGGTT TACCGGTGTC ATTCCGCTGT TATGGCCGCG 102 0 

TTTGTCTCAT TCCACGCCTG ACACTCAGTT CCGGGTAGGC AGTTCGCTCC AAGCTGGACT 1080 

GTATGCACGA ACCCCCCGTT CAGTCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG 1140 

AGTCCAACCC GGAAAGACAT GCAAAAGCAC CACTGGCAGC AGCCACTGGT AATTGATTTA 1200 

GAGGAGTTAG TCTTGAAGTC ATGCGCCGGT TAAGGCTAAA CTGAAAGGAC AAGTTTTGGT 1260 

GACTGCGCTC CTCCAAGCCA GTTACCTCGG TTCAAAGAGT TGGTAGCTCA GAGAACCTTC 132 0 

GAAAAACCGC CCTGCAAGGC GGTTTTTTCG TTTTCAGAGC AAGAGATTAC GCGCAGACCA 1380 

AAACGATCTC AAGAAGATCA TCTTATTAAT CAGATAAAAT ATTTCTAGAT TTCAGTGCAA 1440 

TTTATCTCTT CAAATGTAGC ACCTGAAGTC AGCCCCATAC GATATAAGTT GTAATTCTCA 1500 

TGTTTGACAG CTTATCATCG GATGGATCTG AAATTGTAAA CGTTAATATT TTGTTAAATT 1560 
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CGCGTTAAAT TTTTGTTAAA TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT 1620 

CCCTTATAAA TCAAAAGAAT AGACCGAGAT AGGGTTGAGT GTTGTTCCAG TTTGGAACAA 1680 

GAGTCCACTA TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG TCTATCAGGG 1740 

CGATGGCCCA CTACGTGAAC CATCACCCTA ATCAAGTTTT TTGGGGTCGA GGTGCCGTAA 1800 

AGCACTAAAT CGGAACCCTA AAGGGAGCCC CCGATTTAGA GCTTGACGGG GAAAGCCGGC 1860 

GAACGTGGCG AGAAAGGAAG GGAAGAAAGC GAAAGGAGCG GGCGCTAGGG CGCTGGCAAG 1920 

TGTAGCGGTC ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTAATGCGC CGCTACAGGG 198 0 

CGCGTCAGAT CCCATCGATA AGCTTTAATG CGGTAGTTTA TCACAGTTAA ATTGCTAACG 2040 

CAGTCAGGCA CCGTGTATGA AATCTAACAA TGCGCTCATC GTCATCCTCG GCACCGTCAC 2100 

CCTGGATGCT GTAGGCATAG GCTTGGTTAT GCCGGTACTG CCGGGCCTCT TGCGGGATAT 2160 

CGTCCATTCC GACAGCATCG CCAGTCACTA TGGCGTGCTG CTAGCGCTAT ATGCGTTGAT 2220 

GCAATTTCTA TGCGCACCCG TTCTCGGAGC ACTGTCCGAC CGCTTTGGCC GCCGCCCAGT 2280 

CCTGCTCGCT TCGCTACTTG GAGCCACTAT CGACTACGCG ATCATGGCGA CCACACCCGT 2340 

CCTGTGGATC CGCTGGCGAA AGGGGGATGT GCTGCAAGGC GATTAAGTTG GGTAACGCCA 2400 

GGGTTTTCCC AGTCACGACG TTGTAAAACG ACGGCCAGTG AATTGCGGCC GCCCTGCAAG 2460 

GAAGGGAATG TCGCCAACAG CGAAGAGAGT TGGGCAACGG ATGTGCTGGT GGAGGTGATC 2520 

GCCTCCTGAT GATGAGCCGC TCCCGATGTG GTGTCGGGAG CGGTATTTTC TATAAAACTT 25 8 0 

ACCGCTTATT TGAGATATTC ATCGAAAATG TCGAGTAATT CTTGATGTAT ACACGGCCAT 2640 

TCCTGACCTA AATTGACGGT ACACAAGCCA ATATCGAAGC CATTAATTTT ATAACGATGT 2700 

TTCACTGCGG TATCTACGTG GGGATATATT AATAACCCCC CTATGTTTTC GCCATTTTCA 2760 

GGCTTTAACG ACCATAAGTA ATTCATCAGT TGATAAAGAT TTTGCGAATG AAATTTTTCT 2820 

GTTCCCATTC GTCGTGAAAA AATGCTCTTA TAGTATTTGG CGTCAACGAT AAGTATTTTT 2880 

TCTGATGAGC GAATGGTGAT GTCAGTTTCC ATTCGAGGTA ACAAATTAAG TGACTGATCC 2940 

GATATACTCG ATGCATCCCA TTTTAAATAA GAGCGGGTTG TGTTTGCAGA CGTTAATTCA 3 000 

CGACX3GCAAA ATTCATAAAG AAACTTTTGA TAAAGTAATG ACATCTCTTT TTCGTTTCTT 3060 

TCAAAATCAT AGAAACGGTA GTGTCCTTTG TTTTGACCTG GAATAGAATT ATTGACGATG 3120 

AATTTGCAGA CACTGATAAC GAATTTATAA TAACGCGTAT TTTTTCCGCC ATTCAGATAG 3180 

CTGAAATGCT GCGGAGTTAA ATGAAGAGTG CTAATGCCCG GTAATTTTCT ATAAAGTGAA 324 0 

CGAGCTTCAT CTCTGATAGT TGAATTTAAC TTTTCATGCT TAATTAATAT GGCTAATGTG 3 3 00 
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CTTTTTATAA TTCGGTTAGC 


CAGCGTGTCT 


TCATTAAGCA 


TATCAAAAGT 


ACTGACGGTT 


3360 


TTCCCATGAT 


TAAGATGGAA 


GCCGCGTATT 


GTTTTAGCAA 


ACTCTATTCG 


CCCTTTGATG 


3420 


CCAGGAATGA TCTCGGTGTT AGGATTGTAA TCAAGCTCAA GCCCTCGGCG 


TGAAAGCTGT 


3480 


AAAACCCCTT 


TATTTAATAC 


ATACCCCAGG 


ATATCAAGAA GATTGTTACC 


GGGTATGGCT 


3540 


TCAAGGTTTG 


CCTGCTTAAT 


TTCCTGTAAA 


TAACCCCATG 


CATAGGTAAG 


CATGTAATAG 


3600 


ATATTACGGA 


CAGGTATCAC 


GGGCTGTTCC 


ACTATGAGTC 


CCCTAATAAT 


TTGTTGGTCC 


3660 


ATTTCTGTTG 


TTTATAGGGG 


TCATCAAAGA 


AATATTCTTC 


GAGTAAAGGG 


GCGATATCCG 


3720 


TCATCACAAT 


TTCATTAAGC 


CATTGCGTAT 


CCGGAGAGGT 


GCCATCTTCC 


AACCCACAGC 


3780 


AGAAGTAACT 


ATGCCCAATG 


CGGAATCCTT 


TCCCAAGGAT 


AGTGGCCTCT 


TTGCTGATTT 


3840 


CCTGGTTCAA 


CTCGTTCATT 


TTTTGGCATA 


AAGACTCAAC 


AAATGAAGGT 


TCTGCTTTTT 


3900 


TATTCAGTAA 


AAAATTCCGG 


AACTGTGGTG 


TATCAAAACC 


TGGCTCAATA 


TCTATGAAAG 


3960 


AAAATCGTCT 


GCGTAGGGCA 


TAGTCAACAA 


CGGCCAGAGA 


GCGATCGGCA 


GTATTCATTA 


4020 


AACCGATGAT 


ATAAACATTC 


TCCGGGACAT 


AGAATCGTTC 


TTCATCGTTT 


TCGGAGTAGG 


4080 


TTAGGGGAAC AGACCAGTTT 


TCACCTCGTT 


TATCATGTTC 


CATTAACATC 


ATCACTTCGC 


4140 


CAAATACTTT 


ACTGAGATTG 


GCACGATTGA 


TTTCATCTAT 


AATAAAAATA 


TACTTTTTCT 


4200 


CTGGCTGCTC 


TTTAGCTTGC 


TGACAAAAAT 


TGTAAAATAT 


GCCGTCTTTA 


CGTCGGAAGC 


4260 


CGACGCCATT 


CGGACGATAG 


CCCTGTATAA 


AATCCTCATA 


GCTATAAGAT 


TGATGGAACT 


4320 


GAACCATATT 


GACGCGTTGC 


GGAGCCTTTT 


CTCCTGTCAG 


CAAGTAAGCC 


AGACGGCGTG 


4380 


CAACAAAGGT 


TTTTCCAACG 


CCGGGCGGCC 


CCTGGAGGAT 


AATATTTTTT 


TTGATGGTTA 


4440 


ATCGTTTGAG 


TATCGTCTCT 


ATTGTGGTTT 


CAGGGATAAA 


CAAATCATTT 


AACGCATCTT 


4500 


CCAGACAGTA 


TGATTCAGTT 


TTTGACATAG 


GTGGAATAAC 


ACTCTTGCCA 


GAATTAAATA 


4560 


TTAATTTATA 


GTCGTTGATT 


ATGTTGTCCA 


GCATAGAGGC 


AAATCGGGTG 


TAATCAATAC 


4620 


CCTGTGAGAC 


TTTTTGGGAA 


CAGGCGTAAT 


AGGACTGTCC 


GTATTTTTTA 


GGATATACAC 


4680 


CCGAAGTTGC 


CTGAAAATAC 


TCTGCGATTG 


TTTTAGGTAT 


GTCTGAAGAG 


AACTGCCATT 


4740 


GGGCATGTGG 


TTCATTCGTG 


TCGCTTATAC 


CATAAGCCAA 


AACCAACTCA 


TCAAAATCTT 


4800 


TATAATAGAG 


AATAACGGGA 


TATATACCGT 


TAGAAGCTTC 


CTGACCTTCT 


CCAAGAAATG 


4860 


CAAACCAGGG 


AATAGACGTA 


AAATTACCAT 


AACCGAAACT 


CAATTTTACT 


CGCAGGTTAC 


4920 


GGTAAGACGT 


TGGATAATCT 


TTAGTGGATT 


GCGAACGTTG 


TTGCTGTGCT 


TGCTTAATAA 


4980 


ATTTTTCAAT 


CCAGGGTTGA 


ATAGATTCCA 


TAAGATATGC 


CTTCCTCATT 


GCTAAGCCTC 


5040 
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TATTATCGCT TTCGCAACGT ACTGAAACAA 
ATATTTACTG AGGGGGA?VAG TTTCTATTGA 

GAGTAAACGC TGTTACGCGA CTTTCTGTTT 
CTGCTTTTGT TTACGCGCCG ACAGTTTGCA 
CTCCTCGGCG GCGGGTGGTT GGGCGGTGAG 
TACCGCCGTG CCAGTGGCAA AACCGGCGGC 
GGGAATACGG CTGTAATCCG GGTAGCGACT 
TGCCGGGGAG ACTTCTGCTT CGAACGGTTG 
TATTCTCCAG GAATAGTGAT TGTGATTAGC 
GCGCTAAATA CCTGTATATA TCATCAGTAA 
AAAATGGGCA ATTTCTGGAA TGATTTAAAT 
TAATAAAAAG CGCACCGGAA AGGTGCGCCA 
GCTTTTTTAC CCCCGCTAGC TGCGCGTTCA 
CT6TTTTCTC CGCTGATCAA ATCCGGGTTT 
CGGAACGCTT TTGCCAGGAT GGATTGCGTC 
ACCTGTTTTT CTATGGTGTC GGCGTAGGCG 
GCTTGTTCTT TTACTGGAGG TAATAAAACA 
CCTTTTTGAC CAGAAGTTGT TTTCACGCAG 
GAAAAAAATA TTTCGATATA TTCTGGTAAA 
TTATCAGGAT ATAGCAAATT TTGATGTTGT 
AATTCTAAAC TTCCX3TTATA GCGAGTAAAT 
TTTAGTTCAC TTTCTGAACA TTCTAGAGTC 
ATGGTCATAG CTGTTTCCTG TGTGAAATTG 
AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG 
TGCGTTGCGC TCACTGCCCG CTTTCCAGTC 
CGCCGGACGC ATCGTGGCCG GCATCACCGG 
CGCCGACATC ACCGATGGGG AAGATCGGGC 
CGGCGTGGGT ATGGTGGCAG GCCCCGTGGC 
TGCACCATTC CTTGCGGCGG CGGTGCTCAA 



TAGATTTTTA CTGCAAAATC AGACTGGTAA 5100 

GTCAGTGGAA GGCTCCCGGT GGTTAACCGG 5160 

ACCGGCAATC ACTCCAATAA ACGCCTGCAC 5220 

CACCTGGCGT AGCGACTGCA TCAGTTCGCT 5280 

GACAATACAG CCTTCCATCA CTTTGACATC 5340 

TTCCAGCCAC TGACCTTTCA GGGTGATGGC 5400 

CGCATAACCG ACGGTGACAT GACGGTTATT 5460 

TGCAATAGAA TGCGTGTCAG TCATAACTGC .5520 

GATGCGGGTG TGTTGGCGCA CATCCGCACC 558 0 

ATATGGGGAA AGTCCAGCTA AAAATAGAAT 5640 

ATATTTATGT GGGTTATGAT TGGCGTGAAA 5700 

GAAAATAATG TTCAGGATTT TTTACGTGAG 5760 

GCTTTGATTT TTTCCAGCAA CGCGGCGGCG 5820 

TCGGCCCGCC ACTGGGCGGT AAGTTCACCA 5880 

AGGTTGTTGA CGCGGGCTAA GGCGTTGTTG 5940 

AAGAGTTGCT CGACGCGGCG AACGATTTCG 6000 

ACTTGGGATT TGATATCTTT TCCTGAAATA 6 060 

TTCATCATTG CATTTCGTGC TGAGGGGGAT 6120 

GCATCTTTGG TTAATCGAGC TCGAATAAGT 6180 

AATTTTTTCA ATAACCCACA AACACCAACA 624 0 

AAAAGATCTC CATCTTGTAA TTTGTGGCGG 6300 

GACCTGCAGG CATGCAAGCT TGGCGTAATC 6360 

TTATCCGCTC ACAATTCCAC ACAACATACG 6420 

TGCCTAATGA GTGAGCTAAC TCACATTAAT 6480 

GGGAAACCTG TCGTGCCAGC GGATCCTCTA 6540 

CGCCACAGGT GCGGTTGCTG GCGCCTATAT 6600 

TCGCCACTTC GGGCTCATGA GCGCTTGTTT 6660 

CGGGGGACTG TTGGGCGCCA TCTCCTTGCA 672 0 

CGGCCTCAAC CTACTACTGG GCTGCTTCCT 6780 
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AATGCAGGAG TCGCATAAGG . GAGAGCGTCG ACCGATGCCC TTGAGAGCCT TCAACCCAGT 6840 

CAGCTCCTTC CGGTGGGCGC GGGGCATGAC TATCGTCGCC GCACTTATGA CTGTCTTCTT 6900 

TATCATGCAA CTCGTAGGAC AGGTGCCGGC AGCGCTCTGG GTCATTTTCG GCGAGGACCG 6960 

CTTTCGCTGG AGCGCGACGA TGATCGGCCT GTCGCTTGCG GTATTCGGAA TCTTGCACGC 7020 

CCTCGCTCAA GCCTTCGTCA CTGGTCCCGC CACCAAACGT TTCGGCGAGA AGCAGGCCAT 708 0 

TATCGCCGGC ATGGCGGCCG ACGCGCTGGG CTACGTCTTG CTGGCGTTCG CGACGCGAGG 7140 

CTGGATGGCC TTCCCCATTA TGATTCTTCT CGCTTCCGGC GGCATCGGGA TGCCCGCGTT 7200 

GCAGGCCATG CTGTCCAGGC AGGTAGATGA CGACCATCAG GGACAGCTTC AAGGATCGCT 7260 

CGCGGCTCTT ACCAGCCTAA CTTCGATCAT TGGACCGCTG ATCGTCACGG CGATTTATGC 7320 

CGCCTCGGCG AGCACATGGA ACGGGTTGGC ATGGATTGTA GGCGCCGCCC TATACCTTGT 7380 

CTGCCTCCCC GCGTTGCGTC GCGGTGCATG GAGCCGGGCC ACCTCGACCT GAATGGAAGC 7440 

CGGCGGCACC TCGCTAACGG ATTCACCACT CCAAGAATTG GAGCCAATCA ATTCTTGCGG 750 0 

AGAACTGTGA ATGCGCAAAC CAACCCTTGG CAGAACATAT CCATCGCGTC CGCCATCTCC 7560 

AGCAGCCGCA CGCGGCGCAT CTCGGGCAGC GTTGGGTCCT GGCCACGGGT GCGCATGATC 7620 

GTGCTCCTGT CGTTGAGGAC CCGGCTAGGC TGGCGGGGTT GCCTTACTGG TTAGCAGAAT 7680 

GAATCACCGA TACGCGAGCG AACGTGAAGC GACTGCTGCT GCAAAACGTC TGCGACCTGA 7740 

GCAACAACAT GAATGGTCTT CGGTTTCCGT GTTTCGTAAA GTCTGGAAAC GCGGAAGTCC 7 8 00 

CCTACGTGCT GCTGAAGTTG CCCGCAACAG A6AGTGGAAC CAACCGGTGA TACCACGATA 7860 

CTATGACTGA GAGTCAACGC CATGAGCGGC CTCATTTCTT ATTCTGAGTT ACAACAGTCC 7920 

GCACCGCTGC CGGTAGCTCC TTCCGGTGGG CGCGGGGCAT GACTATCGTC GCCGCACTTA 7980 

TGACTGTCTT CTTTATCATG CAACTCGTAG GACAGGTGCC GGCAGCGCCC AACAGTCCCC 8040 

CGGCCACGGG GCCTGCCACC ATACCCACGC CGAAACAAGC GCCCTGCACC ATTATGTTCC 8100 

GGATCTGCAT CGCAGGATGC TGCTGGCTAC CCTGTGGAAC ACCTACATCT GTATTAACGA 8160 

AGCGCTAACC GTTTTTATCA GGCTCTGGGA GGCAGAATAA ATGATCATAT CGTCAATTAT 8220 

TACCTCCACG GGGAGAGCCT GAGCAAACTG GCCTCAGGCA TTTGAGAAGC ACACGGTCAC 82 80 

ACTGCTTCCG GTAGTCAATA AACCGGTAAA CCAGCAATAG ACATAAGCGG CTATTTAACG 8340 

ACCCTGCCCT GAACCGACGA CCGGGTCGAA TTTGCTTTCG AATTTCTGCC ATTCATCCGC 8400 

TTATTATCAC TTATTCAGGC GTAGCAACCA GGCGTTTAAG GGCACCAATA ACTGCCTTAA 8460 

AAAAATTACG CCCCGCCCTG CCACTCATCG CAGTACTGTT GTAATTCATT AAGCATTCTG 8520 
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CCGACATGGA AGCCATCACA GACGGCATGA TGAACCTGAA TCGCCAGCGG CATCAGCACC 8580 

TTGTCGCCTT GCGTATAATA TTTGCCCATG GTGAAAACGG GGGCGAAGAA GTTGTCCATA 8640 

TTGGCCACGT TTAAATCAAA ACTGGTGAAA CTCACCCAGG GATTGGCTGA GACGAAAAAC 8700 

ATATTCTCAA TA2VACCCTTT AGGGAAATAG GCCAGGTTTT CACCGTAACA CGCCACATCT 8760 

TGCGAATATA TGTGTAGAAA CTGCCGGAAA TCGTCGTGGT ATTCACTCCA GAGCGATGAA 882 0 

AACGTTTCAG TTTGCTCATG GAAAACGGTG TAACAAGGGT GAACACTATC CCATATCACC 8880 

AGCTCACCGT CTTTCATTGC CATACG 8906 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOIiOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "pRM2 (target plasmid) " 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

GCGCCCAATA CGCAAACCGC CTCTCCCCGC GCGTTGGCCG ATTCATTAAT GCAGCTGGCA 60 

CGACAGGTTT CCCGACTGGA AAGCGGGCAG TGAGCGCAAC GCAATTAATG TGAGTTAGCT 12 0 

CACTCATTAG GCACCCCAGG CTTTACACTT TATGCTTCCG GCTCGTATGT TGTGTGGAAT 180 

TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGCTATGAC CATGATTACG AATTCGAGCT 240 

CGGTACCCGG GGATCCTCTA GAGTCGAGAT GCCGCATGTG GAAGAGGTGA TTGCACCGAT 300 

CTTCTACACC GTTCCGCTGC AGCTGCTGGC TTACCATGTC GCGCTGATCA AAGGCACCGA 360 

CGTTGACCAG CCGCGTAACC TGGCAAAATC GGTTACGGTT GAGTAATAAA TGGATGCCCT 420 

GCGTAAGCGG GGCATTTTTC TTCCTGTTAT GTTTTTAATC AAACATCCTG CCAACTCCAT 480 

GTGACAAACC GTCATCTTCG GCTACTTTTT CTCTGTCACA GAATGAAAAT TTTCTGTCAT 540 

CTCTTCGTTA TTAATGTTTG TAATTGACTG 7ATATCAACG CTTATTTAAA TCAGACTGAA 600 

GACTTATCTC TCTCTGTCAT AAAACTGTCA TATTCCTTAC ATATAACTGT CACCTGTTTG 660 

TCCTATTTTG CTTGTCGTAG CCAACAAACA ATGCTTTATG AATCCTCCCA GGAGACATTA 720 

TGAAAGTTAT GCGTACCACC GTCGCAACTG TTGTCGCCGC GACCTTATCG ACCTGCAGGC 780 

ATGCAAGCTT GGCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA 840 



19 



CCCAACTTAA 


TCGCCTTGCA 


GCACATCCCC 


CTTTCGCCAG 


CTGGCGTAAT 


AGCGAAGAGG 


900 


CCCGCACCGA 


TCGCCCTTCC 


CAACAGTTGC 


GCAGCCTGAA TGGCGAATGG 


CGCCTGATGC 


960 


GGTATTTTCT 


CCTTACGCAT 


CTGTGCGGTA 


TTTCACACCG 


CATATGGTGC 


ACTCTCAGTA 


1020 


CAATCTGCTC 


TGATGCCGCA 


TAGTTAAGCC 


AGCCCCGACA 


CCCGCCAACA 


CCCGCTGACG 


1080 


CGCCCTGACG 


GGCTTGTCTG 


CTCCCGGCAT 


CCGCTTACAG 


ACAAGCTGTG 


ACCGTCTCCX3 


1140 


GGAGCTGCAT 


GTGTCAGAGG 


TTTTCACCGT 


CATCACCGAA 


ACGCGCGAGA 


CGAAAGGGCC 


1200 


TCGTGATACG 


CCTATTTTTA 


TAGGTTAATG 


TCATGATAAT 


AATGGTTTCT 


TAGACGTCAG 


1260 


GTGGCACTTT 


TCGGGGAAAT 


GTGCGCGGAA 


CCCCTATTTG 


TTTATTTTTC 


TAAATACATT 


1320 


CAAATATGTA 


TCCGCTCATG 


AGACAATAAC 


CCTGATAAAT 


GCTTCAATAA 


TATTGAAAAA 


1380 


GGAAGAGTAT 


GAGTATTCAA 


CATTTCCGTG 


TCGCCCTTAT 


TCCCTTTTTT 


GCGGCATTTT 


1440 


GCCTTCCTGT 


TTTTGCTCAC 


CCAGAAACGC 


TGGTGAAAGT 


AAAAGATGCT 


GAAGATCAGT 


1500 


TGGGTGCACG 


AGTGGGTTAC 


ATCGAACTGG 


ATCTCAACAG 


CGGTAAGATC 


CTTGAGAGTT 


1560 


TTCGCCCCGA AGAACGTTTT 


CCAATGATGA 


GCACTTTTAA 


AGTTCTGCTA 


TGTGGCGCGG 


1620 


TATTATCCCG 


TATTGACGCC 


GGGCAAGAGC 


AACTCGGTCG 


CCGCATACAC 


TATTCTCAGA 


1680 


ATGACTTGGT 


TGAGTACTCA 


CCAGTCACAG 


AAAAGCATCT 


TACGGATGGC 


ATGACAGTAA 


1740 


GAGAATTATG 


CAGTGCTGCC 


ATAACCATGA 


GTGATAACAC 


TGCGGCCAAC 


TTACTTCTGA 


1800 


CAACGATCGG 


AGGACCGAAG 


GAGCTAACCG 


CTTTTTTGCA 


CAACATGGGG 


GATCATGTAA 


1860 


CTCGCCTTGA 


TCGTTGGGAA 


CCGGAGCTGA 


ATGAAGCCAT 


ACCAAACGAC 


GAGCGTGACA 


1920 


CCACGATGCC 


TGTAGCAATG 


GCAACAACGT 


TGCGCAAACT 


ATTAACTGGC 


GAACTACTTA 


1980 


CTCTAGCTTC 


CCGGCAACAA 


TTAATAGACT 


GGATGGAGGC 


GGATAAAGTT 


GCAGGACCAC 


2040 


TTCTGCGCTC 


GGCCCTTCCG 


GCTGGCTGGT 


TTATTGCTGA 


TAAATCTGGA 


GCCGGTGAGC 


2100 


GTGGGTCTCG 


CGGTATCATT 


GCAGCACTGG 


GGCCAGATGG 


TAAGCCCTCC 


CGTATCGTAG 


2160 


TTATCTACAC' 


GACGGGGAGT 


CAGGCAACTA 


TGGATGAACG 


AAATAGACAG 


ATCGCTGAGA 


2220 


TAGGTGCCTC 


ACTGATTAAG 


CATTGGTAAC 


TGTCAGACCA 


AGTTTACTCA 


TATATACTTT 


2280 


AGATTGATTT 


AAAACTTCAT 


TTTTAATTTA 


AAAGGATCTA 


GGTGAAGATC 


CTTTTTGATA 


2340 


ATCTCATGAC 


CAAAATCCCT 


TAACGTGAGT 


TTTCGTTCCA 


CTGAGCGTCA 


GACCCCGTAG 


2400 


AAAAGATCAA 


AGGATCTTCT 


TGAGATCCTT 


TTTTTCTGCG 


CGTAATCTGC 


TGCTTGCAAA 


2460 


CAAAAAAACC 


ACCGCTACCA 


GCGGTGGTTT 


GTTTGCCGGA 


TCAAGAGCTA 


CCAACTCTTT 


2520 


TTCCGAAGGT 


AACTGGCTTC 


AGCAGAGCGC 


AGATACCAAA 


TACTGTCCTT 


CTAGTGTAGC 


2580 



CGTAGTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 264 0 

TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA 2700 

GACGATAGTT ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC 2760 

CCAGCTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CTATGAGAAA 2820 

GCGCCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 2880 

CAGGAGAGCG CACGAGGGAG CTTCCAGGGG GAT^CGCCTG GTATCTTTAT AGTCCT6TCG 2940 

GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 3000 

TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 3060 

CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG 3120 

AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 3180 

AAGCGGAAGA 3^90 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC95) 
used to analyze products of transposition." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
ATAATCCTTA AAAACTCCAT TTCCACCCCT 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC209) 
used to analyze products of transposition" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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GTGATTGCAC CGATCTTCTA CACCGTTCC 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC429) 
used to analyze products of transposition" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTTCACCGTC ATCACCGAAA CGCGCGAGAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC430) 
used to analyze products of transposition" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AATGACTTGG TTGAGTACTC ACCAGTCACA 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:' 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC431) 
used to analyze the products of transpostion" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATGAACGAAA TAGACAGATC GCTGAGATAG 
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(2) INFORMATION FOR SEQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC432) 
used to analyze products, of transposition" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
CAAGACGATA GTTACCGGAT AAGGCGCAGC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide (NLC94) 
used for sequence determination" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AAAGTCCAGT ATGCTTTTTC ACAGCATAAC 30 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE': peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Asn Tyr Asn Arg Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asn Tyr Thr Arg Asn 
1 5 
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